Reinforcement Learning based cooperative longitudinal control for reducing traffic oscillations and improving platoon stability
نویسندگان
چکیده
Stop-and-go traffic poses significant challenges to the efficiency and safety of operations. In this study, a cooperative longitudinal control based on Soft Actor Critic (SAC) Reinforcement Learning (RL) is proposed address issue. The reward function carefully designed consider vehicle cooperation achieve three main objectives: safety, efficiency, oscillation dampening. A global performance metric for dampening evaluate developed RL other baseline models. Depending number preceding vehicles that can share maneuver information, two models RL-1 RL-2 are compared with human driven (HD) an adaptive cruise (ACC) model using HighD simulated data. It found information from additional vehicles, dampen shockwaves more efficiently. Specifically, decrease by 15%-36% 15%-42%, respectively, while HD amplifies 14–37%. ACC also but not as effective RL-2. methods further evaluated data collected commercial Model X vehicle. Compared in some controlled settings, better stop-and-go waves generating smaller growth, overshooting, average acceleration/deceleration rate change, suggesting they generalize well new similar environment. Finally, considering platoon different penetration rates. results show consistently outperform shockwaves.
منابع مشابه
Hybrid Cooperative Agents with Online Reinforcement Learning for Traffic Control
This paper presents the application of fuzzy-neuroevolutionary hybrid system with online reinforcement learning for intelligent road traffic management and control. Taking a step away from the conventional traffic control system, the hybrid system presents different methodologies in knowledge acquisition, decisionmaking, learning and goal formulation with the use of a three-layered hierarchical...
متن کاملReinforcement Learning For Adaptive Traffic Signal Control
By 2050, two-thirds of the world’s 9.6 billion people will live in urban areas [2]. In many cities, opportunities to expand urban road networks are limited, so existing roads will need to more efficiently accommodate higher volumes of traffic. Consequently, there is a pressing need for technologically viable, low-cost solutions that can work with existing infrastructure to help alleviate increa...
متن کاملReinforcement Learning Based PID Control of Wind Energy Conversion Systems
In this paper an adaptive PID controller for Wind Energy Conversion Systems (WECS) has been developed. Theadaptation technique applied to this controller is based on Reinforcement Learning (RL) theory. Nonlinearcharacteristics of wind variations as plant input, wind turbine structure and generator operational behaviordemand for high quality adaptive controller to ensure both robust stability an...
متن کاملFlow: Architecture and Benchmarking for Reinforcement Learning in Traffic Control
Flow is a new computational framework, built to support a key need triggered by the rapid growth of autonomy in ground traffic: controllers for autonomous vehicles in the presence of complex nonlinear dynamics in traffic. Leveraging recent advances in deep Reinforcement Learning (RL), Flow enables the use of RL methods such as policy gradient for traffic control and enables benchmarking the per...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Transportation Research Part C-emerging Technologies
سال: 2022
ISSN: ['1879-2359', '0968-090X']
DOI: https://doi.org/10.1016/j.trc.2022.103744